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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: CEDARS-^INAI MEDICAL CENTER 

(ii) TITLE OF INVENTIONA NUCLEIC ACID ENCODING SPINOCEREBELLAR 
ATAXIA- 2 AND PRODUCTS RELATED THERETO 

(iii) NUMBER OF SEQUENCES :\ 5 

(iv) CORRESPONDENCE ADDRE5 

(A) ADDRESSEE: Campbkll & Flores LLP 

(B) STREET: 4370 La J^lla Village Drive, Suite 700 

(C) CITY: San Diego 

(D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 92122 

(v) COMPUTER READABLE FORM J 

(A) MEDIUM TYPE: Floppy ^isk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC ^pOS /MS-DOS 

(D) SOFTWARE: PatentIn Release #1.0, Version #1.25 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Ramos, Robert T. 

(B) REGISTRATION NUMBER: 37V 915 

(C) REFERENCE /DOCKET NUMBER :\ FP CE 2563 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (619) 535-9001^ 

(B) TELEFAX: (619) 535-8949 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 516 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
TTGGTAGCAA CGGAAACGGC GGCGGCGCGT TTCGGCCCGG CTCCcfcsCGG CTCCTTGGTC 
TCGGCGGGCC TCCCCGCCCC TTCGTCGTCG TCCTTCTCCC CCTCGCd^GC CCGGGCGCCC 
CTCCGGCCGC GCCAACCCGC GCCTCCCCGC TCGGCGCCCG TGCGTCCCJCG CCGCGTTCCG 



60 
120 
180 



\ 52 
GCGTCTCCTT GGCGCGCCCG GCTC CCGGCT GTCCCCGCCC 
GGCCCCTCAC CATGTCGCTG AAGC CCCAGC AGCAGCAGCA 
AGCAGCAACA GCAGCAGCAG CAGC AGCAGC AGCCGCCGCC 
AGCCCGGCGG CAGCGGCCTT CTAd|cGTCGC CCGCCGCCGC 
CGGTCTCCTC GTCCTCGGCC ACGGCTCCCT CCTCGGTGGT 
GGAGGCCCGG CCTGGGCAGG TCGGTOTCGG CACCCC 



GGCGTGCGAG CCGGTGTATG 
GCAGCAGCAG CAACAGCAGC 
CGCGGCTGCC AATGTCCGCA 
GCCTTCGCCG TCCTCGTCCT 
CGCGGCGACC TCCGGCGGCG 



) N0:2: 



(2) INFORMATION FOR SEQ ID 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4481 bkse pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: b\)th 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: I63..4ldd. 



240 
300 
360 
420 
480 
516 



(xi) SEQUENCE DESCRIPTION: SED ID NO : 2 : 
ACCCCCGAGA AAGCAACCCA GCGCGCCGCC qcCTCCTCAC 
GCCACCTCAC GTTCTGCTTC CGTCTGACCC cilCCGACTTC 
CACCTCCGCT CCCACCCGGC GCCTCGGCGC GCACGCCCTC 



GCC GCA GCT CCT CGG AGT CCC GCG GTG GTC ACC 
Ala Ala Ala Pro Arg Ser Pro Ala Val AlW Thr 
5 10 \ 15 

GCC GCA GCC AGG TGG CCC GGG TGG CGC TCG\ CTC 
Ala Ala Ala Arg Trp Pro Gly Trp Arg Ser )Leu 
25 30 

CGG AGC GGG CGG GGC GGC GGT GGC GCG GCC ClCG 
Arg Ser Gly Arg Gly Gly Gly Gly Ala Ala P; 

40 45 

GCC GCC CCT CCC CCG CCC GGC CCC GGC CCC CCT^ 
Ala Ala Pro Pro Pro Pro Gly Pro Gly Pro Pro 
55 60 



GTGTCCCTCC CGGCCCCGGG 

CGGTAAAGAG TCCCTATCCG 

CG ATG CGC TCA GCG 
Met Arg Ser Ala 
1 

GAG TCT CGC CGC TTC 
Glu Ser Arg Arg Phe 
20 

CAG CGG CCG GCG CGG 
Gin Arg Pro Ala Arg 
35 

GGA CCG TAT CCC TCC 
Gly Pro Tyr Pro Ser 
50 



CCC TCC CGG CAG AGC 
TO Ser Arg Gin Ser 
65 



60 
120 
174 

222 

270 

318 

366 



53 



TCG CCT CCC TCC GCC TCA dAC TGT TTT GGT AGC AAC GGC AAC GGC GGC 
Ser Pro Pro Ser Ala Ser Abp Cys Phe Gly Ser Asn Gly Asn Gly Gly 
70 175 80 

GGC GCG TTT CGG CCC GGC T^C CGG CGG CTC CTT GGT CTC GGC GGG CCT 
Gly Ala Phe Arg Pro Gly s\r Arg Arg Leu Leu Gly Leu Gly Gly Pro 
85 90 \ 95 100 

CCC CGC CCC TTC GTC GTC GTC CTT CTC CCC CTC GCC AGC CCG GGC GCC 
Pro Arg Pro Phe Val Val Val Leu Leu Pro Leu Ala Ser Pro Gly Ala 
105 \ 110 

CCT CCG GCC GCG CCA ACC CGc\gCC TCC CCG CTC GGC GCC CGT GCG TCC 
Pro Pro Ala Ala Pro Thr ArgVAla Ser Pro Leu Gly Ala Arg Ala Ser 
120 \ 125 130 

CCG CCG CGT TCC GGC GTC TCC ItG GGG CGC CCG GCT CCC GGC TGT CCC 
Pro Pro Arg Ser Gly Val Ser Lteu Ala Arg Pro Ala Pro Gly Cys Pro 
135 140 145 

CGC CCG GCG TGC GAG CCG GTG TaV GGG CCC CTC ACC ATG TCG CTG AAG 
Arg Pro Ala Cys Glu Pro Val TyA Gly Prb Leu Thr Met Ser Leu Lys 
150 155 \ 160 

CCC CAG CAG CAG CAG CAG CAG CAG bAG CAA CAG CAG CAG CAG CAA CAG 
Pro Gin Gin Gin Gin Gin Gin Gin &ln Gin Gin Gin Gin Gin Gin Gin 
165 170 \ 175 180 



414 



462 



510 



558 



606 



654 



702 



CAG CAG CAG CAG CAG CAG CAG CCG CC 
Gin Gin Gin Gin Gin Gin Gin Pro 
185 



CCC GCG GCT GCC AAT GTC CGC 
Pro Ala Ala Ala Asn Val Arg 
190 195 



750 



AAG CCC GGC GGC AGC GGC CTT CTA GCg\tCG CCC GCC GCC GCG CCT TCG 
Lys Pro Gly Gly Ser Gly Leu Leu Ala ^er Pro Ala Ala Ala Pro Ser 
200 205 \ 210 

CCG TCC TCG TCC TCG GTC TCC TCG TCC tV:G GCC ACG GCT CCC TCC TCG 
Pro Ser Ser Ser Ser Val Ser Ser Ser s4r Ala Thr Ala Pro Ser Ser 
215 22Q \ 225 

GTG GTC GCG GCG ACC TCC GGC GGC GGG AG^ CCC GGC CTG GGC AGA GGT 
Val Val Ala Ala Thr Ser Gly Gly Gly Arg^Pro Gly Leu Gly Arg Gly 
230 235 \ 240 

CGA AAC AGT AAC AAA GGA CTG CCT CAG TCT \cG ATT TCT TTT GAT GGA 
Arg Asn Ser Asn Lys Gly Leu Pro Gin Ser ihr He Ser Phe Asp Gly 
245 250 2%5 260 

ATC TAT GCA AAT ATG AGG ATG GTT CAT ATA CtV ACA TCA GTT GTT GGC 
He Tyr Ala Asn Met Arg Met Val His He Led Thr Ser Val Val Gly 
265 270 \ 275 

TCC AAA TGT GAA GTA CAA GTG AAA AAT GGA GGT ATA TAT GAA GGA GTT 
Ser Lys Cys Glu Val Gin Val Lys Asn Gly Gly iVe Tyr Glu Gly Val 
280 285 \ 290 



798 



846 



894 



942 



990 



1038 



54 



TTT AAA ACT TAG AGT CCG AAG TGT GAT TTG GTA CTT GAT GCC GCA CAT 
Phe Lys Thr Tyr Sen Pro Lys Cys Asp Leu Val Leu Asp Ala Ala His 
295 \ 300 305 



1086 



GAG AAA AGT ACA GAA TCC AGT TCG GGG CCG AAA CGT GAA GAA ATA ATG 
Glu Lys Ser Thr Glu Ster Ser Ser Gly Pro Lys Arg Glu Glu lie Met 
310 \ 315 320 



1134 



GAG AGT ATT TTG TTC AAA\ TGT TCA GAC TTT GTT GTG GTA CAG TTT AAA 
Glu Ser lie Leu Phe Lys Vys Ser Asp Phe Val Val Val Gin Phe Lys 
325 330 \ 335 340 



1182 



GAT ATG GAC TCC AGT TAT 
Asp Met Asp Ser Ser Tyr 
345 



AAA AGA GAT GCT TTT ACT GAC TCT GCT 
Lys Arg Asp Ala Phe Thr Asp Ser Ala 
350 355 



1230 



ATC AGT GCT AAA GTG AAT GGCVgAA CAC AAA GAG AAG GAC CTG GAG CCC 
lie Ser Ala Lys Val Asn Gly Glu His Lys Glu Lys Asp Leu Glu Pro 
360 \ 365 370 



1278 



TGG GAT GCA GGT GAA CTC ACA GtC AAT GAG GAA CTT GAG GCT TTG GAA 
Trp Asp Ala Gly Glu Leu Thr A]W Asn Glu Glu Leu Glu Ala Leu Glu 
375 380 385 



1326 



AAT GAC GTA TCT AAT GGA TGG GATXCCC AAT GAT ATG TTT CGA TAT AAT 
Asn Asp Val Ser Asn Gly Trp Asp tt^ro Asn Asp Met Phe Arg Tyr Asn 
390 395 \ 400 



1374 



GAA GAA AAT TAT GGT GTA GTG TCT AtG TAT GAT AGC AGT TTA TCT TCG 
Glu Glu Asn Tyr Gly Val Val Ser Tit Tyr Asp Ser Ser Leu Ser Ser 
405 410 \ 415 420 



1422 



TAT ACA GTG CCC TTA GAA AGA GAT AAC\TCA GAA GAA TTT TTA AAA CGG 
Tyr Thr Val Pro Leu Glu Arg Asp Asn\ser Glu Glu Phe Leu Lys Arg 
425 V30 435 



1470 



GAA GCA AGG GCA AAC CAG TTA GCA GAA 
Glu Ala Arg Ala Asn Gin Leu Ala Glu 
440 445 



ATT GAG TCA AGT GCC CAG 
\u lie Glu Ser Ser Ala Gin 
450 



1518 



TAG AAA GCT CGA GTG GCC CTG GAA AAT GA'A GAT AGG AGT GAG GAA GAA 
Tyr Lys Ala Arg Val Ala Leu Glu Asn Asp\Asp Arg Ser Glu Glu Glu 

455 460 \ 465 



1566 



AAA TAG AGA GGA GTT CAG AGA AAT TCG AGT CGT GAG GGG CAC AGC 

Lys Tyr Thr Ala Val Gin Arg Asn Ser Ser Glu Arg Glu Gly His Ser 
470 475 . \ 480 



1614 



ATA AAG ACT AGG GAA AAT AAA TAT ATT CGT CGTV GGA GAA AGA AAT AGA 
lie Asn Thr Arg Glu Asn Lys Tyr lie Pro ProvGly Gin Arg Asn Arg 
485 490 495 \ 500 



1662 



GAA GTG ATA TGG TGG GGA AGT GGG AGA CAG AAT T&A CCG GGT ATG GGG 
Glu Val lie Ser Trp Gly Ser Gly Arg Gin Asn Se^ Pro Arg Met Gly 
505 510 \ 515 



1710 



CAG CCT GGA 
Gin Pro Gly 



TOG 
Ser 
520 



GGC TCC Y^TG CCA 
Gly Ser Met Pro 



TCA AGA TCC ACT TCT CAC ACT TCA 
Ser Arg Ser Thr Ser His Thr Ser 
525 530 



GAT TTC AAC CCG AAT TCT dGT TCA GAC CAA 
Asp Phe Asn Pro Asn Ser Gu.y Ser Asp Gin 
535 \ 540 



AGA GTA GTT AAT 
Arg Val Val Asn 
545 



GGA GGT 
Gly Gly 



GTT CCC TGG CCA TCG CCT TGt CCA TCT CCT 
Val Pro Trp Pro Ser Pro Cy^ Pro Ser Pro 
550 551 



TCC TCT CGC CCA 
Ser Ser Arg Pro 
560 



CCT TCT 
Pro Ser 



CGC TAC CAG TCA GGT CCC AAC \TCT CTT CCA 
Arg Tyr Gin Ser Gly Pro Asn 5er Leu Pro 
565 570 



CCT CGG GCA GCC 
Pro Arg Ala Ala 
575 



ACC CCT 
Thr Pro 
580 



ACA CGG CCG CCC 
Thr Arg Pro Pro 



TCC AGG 
Ser Arg 
585 



CCC C$C 
Pro Pi 



TCG CGG 
Ser Arg 
590 



CCA TCC AGA CCC 
Pro Ser Arg Pro 



CCG TCT 
Pro Ser 
595 



CAC CCC TCT 
His Pro Ser 



GCT 
Ala 
600 



CAT GGT 
His Gly 



TCT CC 
Ser Pro 



GCT CCT 
iAla Pro 
;05 



GTC TCT 
Val Ser 



ACT ATG 
Thr Met 
610 



CCT AAA 
Pro Lys 



CGC ATG TCT TCA 
Arg Met Ser Ser 
615 



GAA GGG CCT CCA AGG ATG TCC CCA AAG GCC CAG CGA 
Glu Gly Pro Pro /irg Met Ser Pro Lys Ala Gin Arg 
620 \ 625 



CAT CCT CGA AAT 
His Pro Arg Asn 
630 



CAC AGA GTT TCT 
His Arg Val Ser 
635 



GCT GGG 
Ala Gly 



AGG GGT 
Arg Gly 
640 



TCC ATA 
Ser lie 



TCC AGT 
Ser Ser 



GGC CTA GAA TTT 
Gly Leu Glu Phe 
645 



GTA TCC CAC AAC 
Val Ser His Asn 
650 



AGT GAA 
Ser Glu 
655 



GCA GCT 
Ala Ala 



ACT CCT 
Thr Pro 
660 



CCA GTA GCA AGG 
Pro Val Ala Arg 



ACC AGT CCC TCG 
Thr Ser Pro Ser 
665 



ACG TGG 
Thr Trp 



TCA TCA 
Ser Ser 



GTG GTC 
Val Val 
675 



AGT GGG GTT 
Ser Gly Val 



CCA 
Pro 

680 



AGA TTA TCC CCT 
Arg Leu Ser Pro 



AAA ACTICAT AGA CCC AGG TCT CCC 
Lys Thr IHis Arg Pro Arg Ser Pro 

685 \ 690 



AGA CAG AAC AGT 
Arg Gin Asn Ser 
695 



ATT GGA AAT ACC 
lie Gly Asn Thr 
700 



CCC AGT 
Pro Ser 



3G CCA GTT CTT 
Pro Val Leu 
705 



GCT TCT 
Ala Ser 



CCC CAA GCT GGT ATT ATT CCA ACT GAA GCT GTl 
Pro Gin Ala Gly He He Pro Thr Glu Ala 
710 715 



GCC ATG CCT ATT CCA 
t Ala Met Pro He Pro 
b20 



GCT GCA TCT CCT ACG CCT GCT AGT CCT GCA TCG 
Ala Ala Ser Pro Thr Pro Ala Ser Pro Ala Ser 
725 730 ; . 735 



\C AGA GCT GTT ACC 
3n Arg Ala Val Thr 
740 



56 



CCT TCT AGT GAG OCT AAA GAT TCC AGG CTT CAA GAT CAG AGG CAG AAC 
Pro Ser Ser Glu Ala Lys >Asp Ser Arg Leu Gin Asp Gin Arg Gin Asn 
745 \ 750 755 



2430 



TCT CCT GCA GGG AAT AAA GAA AAT ATT AAA CCC AAT GAA ACA TCA CCT 
Ser Pro Ala Gly Asn Lys Glu Asn lie Lys Pro Asn Glu Thr Ser Pro 
760 1 765 770 



2478 



AGC TTC TCA AAA GCT GAA aKc AAA GGT ATA TCA CCA GTT GTT TCT GAA 
Ser Phe Ser Lys Ala Glu Afen Lys Gly lie Ser Pro Val Val Ser Glu 
775 \ 780 785 



2526 



CAT AGA AAA CAG ATT GAT GAi TTA AAG AAA TTT AAG AAT GAT TTT AGG 
His Arg Lys Gin lie Asp Asp\Leu Lys Lys Phe Lys Asn Asp Phe Arg 
790 795\ 800 



2574 



TTA CAG CCA AGT TCT ACT TCT 
Leu Gin Pro Ser Ser Thr Ser 
805 810 



TCT A.TG GAT CAA CTA CTA AAC AAA 
G\u Ser Met Asp Gin Leu Leu Asn Lys 
815 820 



2622 



AAT AGA GAG GGA GAA AAA TCA AGA GAT TTG ATC AAA GAC AAA ATT GAA 
Asn Arg Glu Gly Glu Lys Ser Arg\ Asp Leu lie Lys Asp Lys lie Glu 
825 \ 830 835 



2670 



CCA AGT GCT AAG GAT TCT TTC ATT AAA AAT AGC AGC AGC AAC TGT ACC 
Pro Ser Ala Lys Asp Ser Phe lie C5Q.U Asn Ser Ser Ser Asn Cys Thr 
840 845 850 



2718 



AGT GGC AGC AGC AAG CCG AAT AGC Cci AGC ATT TCC CCT TCA ATA CTT 
Ser Gly Ser Ser Lys Pro Asn Ser ProXser lie Ser Pro Ser lie Leu 
855 860 \ 865 



2766 



AGT AAC ACG GAG CAC AAG AGG GGA CCT GAG GTC ACT TCC CAA GGG GTT 
Ser Asn Thr Glu His Lys Arg Gly Pro G^u Val Thr Ser Gin Gly Val 
870 875 \ 880 



2814 



CAG ACT TCC AGC CCA GCA TGT AAA CAA GAG VAAA GAC GAT AAG GAA GAG 
Gin Thr Ser Ser Pro Ala Cys Lys Gin Glu ^ys Asp Asp Lys Glu Glu 
885 890 a95 900 



2862 



AAG AAA GAC GCA GCT GAG CAA GTT AGG AAA T<m ACA TTG AAT CCC AAT 
Lys Lys Asp Ala Ala Glu Gin Val Arg Lys SeV Thr Leu Asn Pro Asn 

905 910 \ 915 



2910 



GCA AAG GAG TTC AAC CCA CGT TCC TTC TCT CAG \CCA AAG CCT TCT ACT 
Ala Lys Glu Phe Asn Pro Arg Ser Phe Ser Gin Pro Lys Pro Ser Thr 
920 925 \ 930 



2958 



ACC CCA ACT TCA CCT CGG CCT CAA GCA CAA CCT AQC CCA TCT ATG GTG 
Thr Pro Thr Ser Pro Arg Pro Gin Ala Gin Pro SeV Pro Ser Met Val 
935 940 \ 945 



3006 



GGT CAT CAA CAG CCA ACT CCA GTT TAT ACT CAG CCT GTT TGT TTT GCA 
Gly His Gin Gin Pro Thr Pro Val Tyr Thr Gin Pro V^l Cys Phe Ala 
950 955 960 



3054 



57 



CCA AAT ATG ATG TAT CCA ^GTC CCA GTG AGC CCA GGC GTG CAA CCT TTA 
Pro Asn Met Met Tyr Pro ^al Pro Val Ser Pro Gly Val Gin Pro Leu 
965 970 I 975 980 

TAC CCA ATA CCT ATG ACG ctc ATG CCA GTG AAT CAA GCC AAG ACA TAT 
Tyr Pro lie Pro Met Thr pto Met Pro Val Asn Gin Ala Lys Thr Tyr 
985 I 990 995 



AGA GCA GTA CCA AAT ATG CC 
Arg Ala Val Pro Asn Met Pi 
1000 



CAA CAG CGG CAA GAC CAG CAT CAT CAG 
Gin Gin Arg Gin Asp Gin His His Gin 
1005 1010 



AGT GCC ATG ATG CAC CCA GCd TCA GCA GCG GGC CCA CCG ATT GCA GCC 
Ser Ala Met Met His Pro AlaVser Ala Ala Gly Pro Pro He Ala Ala 
1015 \l020 1025 



ACC CCA CCA GCT TAC TCC ACG 
Thr Pro Pro Ala Tyr Ser Thr 
1030 1035 



TAT GTT GCC TAC AGT CCT CAG CAG 
J.n Tyr Val Ala Tyr Ser Pro Gin Gin 
1040 



TTC CCA AAT CAG CCC CTT GTT CAdL CAT GTG CCA CAT TAT CAG TCT CAG 
Phe Pro Asn Gin Pro Leu Val GlnXHis Val Pro His Tyr Gin Ser Gin 
1045 1050 \ 1055 1060 

CAT CCT CAT GTC TAT AGT CCT GTA ATA CAG GGT AAT GCT AGA ATG ATG 
His Pro His Val Tyr Ser Pro Val lUe Gin Gly Asn Ala Arg Met Met 
1065 \ 1070 1075 

GCA CCA CCA ACA CAC GCC CAG CCT Gg\ TTA GTA TCT TCT TCA GCA ACT 
Ala Pro pro Thr His Ala Gin Pro Gl^ Leu Val Ser Ser Ser Ala Thr 
1080 lOsV 3.090 

CAG TAC GGG GCT CAT GAG CAG ACG CAT \cG ATG TAT GCA TGT CCC AAA 
Gin Tyr Gly Ala His Glu Gin Thr His ^a Met Tyr Ala Cys Pro Lys 
1095 1100 \ 1105 

TTA CCA TAC AAC AAG GAG ACA AGC CCT TcV TTC TAC TTT GCC ATT TCC 
Leu Pro Tyr Asn Lys Glu Thr Ser Pro SeA Phe Tyr Phe Ala He Ser 
1110 1115 ^ 1120 

ACG GGC TCC CTT GCT CAG CAG TAT GCG CAC VcT AAC GCT ACC CTG CAC 
Thr Gly Ser Leu Ala Gin Gin Tyr Ala His ftro Asn Ala Thr Leu His 

•mA 1135 1140 

1125 1130 

CCA CAT ACT CCA CAC CCT CAG CCT TCA GCT AcV CCC ACT GGA CAG CAG 
Pro His Thr Pro His Pro Gin Pro Ser Ala ThA Pro Thr Gly Gin Gin 
1145 1150 V 1155 

CAA AGC CAA CAT GGT GGA AGT CAT CCT GCA CCC \gT CCT GTT CAG CAC 
Gin Ser Gin His Gly Gly Ser His Pro Ala Pro ier Pro Val Gin His 
1160 1165 \ 1170 

CAT CAG CAC CAG GCC GCC CAG GCT CTC CAT CTG G4C AGT CCA CAG CAG 
His Gin His Gin Ala Ala Gin Ala Leu His Leu A]U Ser Pro Gin Gin 
1175 1180 \ 1185 



3102 



3150 



3198 



3246 



3294 



3342 



3390 



3438 



3486 



3534 



3582 



3630 



3678 



3726 



58 



CAG TCA GCC ATT TAG CAC GCG GGG CTT GCG CCA ACT CCA CCC TCC ATG 
Gin Ser Ala lie Tyr His Ala ;Gly Leu Ala Pro Thr Pro Pro Ser Met 
1190 II95I 1200 

ACA CCT GCC TCC AAC ACG CAG IcG CCA CAG AAT AGT TTC CCA GCA GCA 
Thr Pro Ala Ser Asn Thr Gin Ser Pro Gin Asn Ser Phe Pro Ala Ala 



1205 



1210 



1215 



1220 



CAA CAG ACT GTC TTT ACG ATC 
Gin Gin Thr Val Phe Thr lie 
1225 

ACC AAC CCA CCC CAC ATG GCC 
Thr Asn Pro Pro His Met Ala 
1240 



\T CCT TCT CAC GTT CAG CCG GCG TAT 
f.s Pro Ser His Val Gin Pro Ala Tyr 
1230 1235 

GTA CCT CAG GCT CAT GTA CAG TCA 
Hife Val Pro Gin Ala His Val Gin Ser 
1245 1250 



GGA ATG GTT CCT TCT CAT CCA ACTXGCC CAT GCG CCA ATG ATG CTA ATG 
Gly Met Val Pro Ser His Pro Thr 
1255 126( 



ACG ACA CAG CCA CCC GGC GGT CCC 
Thr Thr Gin Pro Pro Gly Gly Pro 
1270 1275 



La His Ala Pro Met Met Leu Met 
1265 

GCC GCC CTC GCT CAA AGT GCA 
Ala Ala Leu Ala Gin Ser Ala 
1280 



CTA CAG CCC ATT CCA GTC TCG ACA ACa\ GCG CAT TTC CCC TAT ATG ACG 
Leu Gin Pro He Pro Val Ser Thr Thr Via His Phe Pro Tyr Met Thr 
1285 1290 \ 1295 1300 

CAC CCT TCA GTA CAA GCC CAC CAC CAA CAG CAG TTG TAAGGCTGCC 
His Pro Ser Val Gin Ala His His Gin GVn Gin Leu 
1305 1310 

CTGGAGGAAC CGAAAGGCCA AATTCCCTCC TCCCTTCTAC TGCTTCTACC AACTGGAAGC 

ACAGAAAACT AGAATTTCAT TTATTTTGTT TTTAAA;JtAT ATATGTTGAT TTCTTGTAAC 

ATCCAATAGG AATGCTAACA GTTCACTTGC AGTGGAAG^T ACTTGGACCG AGTAGAGGCA 

TTTAGGAACT TGGGGGCTAT TCCATAATTC CATATGCTGT TTCAGAGTCC CGCAGGTACC 

CCAGCTCTGC TTGCCGAAAC TGGAAGTTAT TTATTTTTTA ATAACCCTTG AAAGTCATGA 

ACACATCAGC TAGCAAAAGA AGTAACAAGA GTGATTCTTg\ CTGCTATTAC TGCTAAAA7VA 

AAAAAAAAAA AAA 



3774 

3822 

3870 

3918 

3966 

4014 

4062 

4108 

4168 
4228 
4288 
4348 
4408 
4468 
4481 



(2) INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1312 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



59 



□ 

m 
m 
m 
m 

= 

o 
111 

m 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Arg Ser Ala Ala Allla Ala Pro Arg Ser Pro Ala Val Ala Thr Glu 
1 5 \ 10 15 

Ser Arg Arg Phe Ala Ali Ala Arg Trp Pro Gly Trp Arg Ser Leu Gin 
20 I 25 30 

Arg Pro Ala Arg Arg Ser\ Gly Arg Gly Gly Gly Gly Ala Ala Pro Gly 
35 \ 40 45 

Pro Tyr Pro Ser Ala Ala Wro Pro Pro Pro Gly Pro Gly Pro Pro Pro 
50 \ 55 60 

Ser Arg Gin Ser Ser Pro Aro Ser Ala Ser Asp Cys Phe Gly Ser Asn 
65 70 \ 75 80 

Gly Asn Gly Gly Gly Ala pAe Arg Pro Gly Ser Arg Arg Leu Leu Gly 
85 \ 90 95 

Leu Gly Gly Pro Pro Arg Pr^ Phe Val Vai Val Leu Leu Pro Leu Ala 
100 I 105 110 

Ser Pro Gly Ala Pro Pro Ala\Ala Pro Thr Arg Ala Ser Pro Leu Gly 
115 \l20 125 

Ala Arg Ala Ser Pro Pro Arg ^er Gly Val Ser Leu Ala Arg Pro Ala 
130 135 \ 140 

Pro Gly Cys Pro Arg Pro Ala C^s Glii Pro Val Tyr Gly Pro Leu Thr 
145 150 \ 155 160 

Met Ser Leu Lys Pro Gin Gin GlA Gin Gin Gin Gin Gin Gin Gin Gin 
165 \ 170 175 

Gin Gin Gin Gin Gin Gin Gin Gin Win Gin Gin Pro Pro Pro Ala Ala 
180 0.85 190 

Ala Asn Val Arg Lys Pro Gly Gly ier Gly Leu Leu Ala Ser Pro Ala 
195 200 \ 205 

Ala Ala Pro Ser Pro Ser Ser Ser skr Val Ser Ser Ser Ser Ala Thr 

210 215 \ 220 

Ala Pro Ser Ser Val Val Ala Ala ThV Ser Gly Gly Gly Arg Pro Gly 
225 230 \ 235 240 

Leu Gly Arg Gly Arg Asn Ser Asn Lys\Gly Leu Pro Gin Ser Thr lie 
245 V250 255 

Ser Phe Asp Gly lie Tyr Ala Asn Met ^g Met Val His He Leu Thr 
260 265 \ 270 

Ser Val Val Gly Ser Lys Cys Glu Val GlA Val Lys Asn Gly Gly He 
275 280 \ 285 



Tyr Glu Gly Val Phe Lys Thr Tyr Ser Pro Lys Cys Asp Leu Val Leu 
290 ^95 300 

Asp Ala Ala His Glu Lys skr Thr Glu Ser Ser Ser Gly Pro Lys Arg 
305 310 I 315 320 

Glu Glu lie Met Glu Ser iLe Leu Phe Lys Cys Ser Asp Phe Val Val 
325 I 330 335 

Val Gin Phe Lys Asp Met A^p Ser Ser Tyr Ala Lys Arg Asp Ala Phe 
340 1 345 350 

Thr Asp Ser Ala He Ser Ala Lys Val Asn Gly Glu His Lys Glu Lys 
355 1 36Q 365 

Asp Leu Glu Pro Trp Asp AlW Gly Glu Leu Thr Ala Asn Glu Glu Leu 
370 37P 380 

Glu Ala Leu Glu Asn Asp Val Ser Asn Gly Trp Asp Pro Asn Asp Met 
385 390 \ 395 400 

Phe Arg Tyr Asn Glu Glu Asn\Tyr Gly Val Val Ser Thr Tyr Asp Ser 
405 \ 410 415 

Ser Leu Ser Ser Tyr Thr Val Vro Leu Glu Arg Asp Asn Ser Glu Glu 
420 \ 425 430 

Phe Leu Lys Arg Glu Ala Arg A^a Asn Gin Leu Ala Glu Glu He Glu 
435 440 445 

Ser Ser Ala Gin Tyr Lys Ala ArV Val Ala Leu Glu Asn Asp Asp Arg 
450 455 \ 460 



Ser Glu Glu Glu Lys Tyr Thr Ala \val Gin Arg Asn Ser Ser Glu Arg 
465 470 \ 475 480 

Glu Gly His Ser He Asn Thr Arg dlu Asn Lys Tyr He Pro Pro Gly 
485 \ 490 495 

Gin Arg Asn Arg Glu Val He Ser T^ Gly Ser Gly Arg Gin Asn Ser 
500 SOS 510 

Pro Arg Met Gly Gin Pro Gly Ser Gl)r Ser Met Pro Ser Arg Ser Thr 
515 520 \ 525 

Ser His Thr Ser Asp Phe Asn Pro Asn\ser Gly Ser Asp Gin Arg Val 
530 535 \ 540 

Val Asn Gly Gly Val Pro Trp Pro Ser ftro Cys Pro Ser Pro Ser Ser 
545 550 \ 555 560 

Arg Pro Pro Ser Arg Tyr Gin Ser Gly pAo Asn Ser Leu Pro Pro Arg 
565 S-k 575 



61 



Ala Ala Thr Pro Thr Arg Pro Pro Ser Arg Pro Pro Ser Arg Pro Ser 
580 585 590 

Arg Pro Pro Ser His Pro SeA Ala His Gly Ser Pro Ala Pro Val Ser 
595 \600 

Thr Met Pro Lys Arg Met Ser Wr Glu Gly Pro Pro Arg Met Ser Pro 
610 615 \ 620 

Lys Ala Gin Arg His Pro Arg aW His Arg Val Ser Ala Gly Arg Gly 
625 630 \ 635 640 

Ser lie Ser Ser Gly Leu Glu PhL Val Ser His Asn Pro Pro Ser Glu 
645 \ 650 655 

Ala Ala Thr Pro Pro Val Ala Ar^ Thr Ser Pro Ser Gly Gly Thr Trp 
660 \665 670 

Ser Ser Val Val Ser Gly Val Pro krg Leu Ser Pro Lys Thr His Arg 
675 680 \ 685 

Pro Arg Ser Pro Arg Gin Asn Ser Jtle Gly Asn Thr Pro Ser Gly Pro 



Arg 
690 



695 



700 



val Leu Ala Ser Pro Gin Ala Gly lite He Pro Thr Glu Ala Val Ala 

710 \ 715 720 



705 



Met Pro He Pro Ala Ala Ser Pro Thr\ Pro Ala Ser Pro Ala Ser Asn 
725 \730 735 

Arg Ala Val Thr Pro Ser Ser Glu Ala \ys Asp Ser Arg Leu Gin Asp 
740 745 \ 750 

Gin Arg Gin Asn Ser Pro Ala Gly Asn L^s Glu Asn He Lys Pro Asn 
755 760 \ 765 



Glu Thr Ser Pro Ser Phe Ser Lys Ala GliA Asn Lys Gly He Ser Pro 
770 775 \ 780 

Val val ser Glu His Arg Lys Gin He Asp ksp Leu Lys Lys Phe Lys 

*95 800 



785 

Asn 



790 



ASP Phe Arg Leu Gin Pro Ser Ser Thr s\r Glu Ser Met Asp Gin 

810 \ 815 



805 



Leu Leu Asn Lys Asn Arg Glu Gly Glu Lys SerVArg Asp Leu He Lys 



820 



825 



830 



Asp Lys He Glu Pro Ser Ala Lys Asp Ser Phe he Glu Asn Ser Ser 



835 



840 



845 



Ser Asn Cys Thr Ser Gly Ser Ser Lys Pro Asn sd^ Pro Ser He Ser 
850 855 86r 



# 
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Pro Ser lie Ser Asn Thr Glu His Lys Arg Gly Pro Glu Val Thr 

865 \ 870 875 880 

Ser Gin Gly VaA Gin Thr Ser Ser Pro Ala Cys Lys Gin Glu Lys Asp 
\bB5 890 895 

Asp Lys Glu Glu Lys Asp Ala Ala Glu Gin Val Arg Lys Ser Thr 

900 \ 905 910 

Leu Asn Pro Asn AlaVys Glu Phe Asn Pro Arg Ser Phe Ser Gin Pro 
915 \ 920 925 

Lys Pro Ser Thr Thr Pr\ Thr Ser Pro Arg Pro Gin Ala Gin Pro Ser 
930 V35 940 

Pro Ser Met Val Gly His gV Gin Pro Thr Pro Val Tyr Thr Gin Pro 
945 950 \ 955 960 

Val Cys Phe Ala Pro Asn Met \et Tyr Pro Val Pro Val Ser Pro Gly 
965 \ 970 975 

val Gin Pro Leu Tyr Pro He Prd^Met Thr Pro Met Pro Val Asn Gin 
980 ^85 990 

Ala Lys Thr Tyr Arg Ala Val Pro Afen Met Pro Gin Gin Arg Gin Asp 
995 1000 \ 1005 

Gin His His Gin Ser Ala Met Met HisXpro Ala Ser Ala Ala Gly Pro 
1010 1015 \ 1020 

Pro He Ala Ala Thr Pro Pro Ala Tyr s\x Thr Gin Tyr Val Ala Tyr 
1025 1030 -.A^^^^ ^^"^^ 

Ser Pro Gin Gin Phe Pro Asn Gin Pro Leu Val Gin His Val Pro His 
1045 1050\ 1055 

Tyr Gin Ser Gin His Pro His Val Tyr Ser ViSo Val He Gin Gly Asn 
1060 1065 \ 1070 

Ala Arg Met Met Ala Pro Pro Thr His Ala GlnVro Gly Leu Val Ser 
1075 1080 \ 1085 

Ser Ser Ala Thr Gin Tyr Gly Ala His Glu Gin T^r His Ala Met Tyr 
1090 1095 llfllO 

Ala Cys Pro Lys Leu Pro Tyr Asn Lys Glu Thr Ser\pro Ser Phe Tyr 
1105 1110 1115 \ 1120 

Phe Ala He Ser Thr Gly Ser Leu Ala Gin Gin Tyr Ala His Pro Asn 
1125 . 1130 \ 1135 

Ala Thr Leu His Pro His Thr Pro His Pro Gin Pro SeA Ala Thr Pro 
1140 1145 \ll50 



# 
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Thr Gly Gin 
1155 



Gin Gin Ser Gin His Gly Gly Ser His Pro Ala Pro Ser 



1160 



1165 



Pro Val Gin His His Gin His Gin Ala Ala Gin Ala Leu His Leu Ala 
1170 \ 1175 1180 

Ser Pro Gin Gin GlA Ser Ala He Tyr His Ala Gly Leu Ala Pro Thr 

1195 1200 



1185 



1190 



Pro Pro Ser Met ThrVPro Ala Ser Asn Thr Gin Ser Pro Gin Asn Ser 
1205) 1210 1215 

Phe Pro Ala Ala Gin GVn Thr Val Phe Thr He His Pro Ser His Val 
1220 \ 122? 1230 

Gin Pro Ala Tyr Thr AsA Pro Pro His Met Ala His Val Pro Gin Ala 
1235 \ 1240 1245 

His val Gin Ser Gly Met Val Pro Ser His Pro Thr Ala His Ala Pro 
1250 ^255 1260 

Met Met Leu Met Thr Thr G^n Pro Pro Gly Gly Pro Gin Ala Ala Leu 
1265 1270 \ 1275 1280 

Ala Gin Ser Ala Leu Gin ProVcle Pro Val Ser Thr Thr Ala His Phe 
1285^ \ 1290 1295 

Pro Tyr Met Thr His Pro Ser vk Gin Ala His His Gin Gin Gin Leu 
1300 \ 1305 1310 



(2) INFORMATION FOR SEQ ID NO : 4 

(i) SEQUENCE CHARACTERISTICS! 

(A) LENGTH: 3798 base p^rs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 50.. 3457 



(xi) SEQUENCE DESCRIPTION: SEQ ID ^:4: 

GGCACGAGGT CCCCGCCCGG CGTGCGAGCC GGTGTA-iGGG CCGCTCACC ATG TCG 

Met Ser 



55 



64 

CTG AAG CCG CAG CCG CAG CCG CCC GCG CCC GCC ACT GGC CGC AAG CCC 103 
Leu Lys Pro Gin Pro Gin Pro Pro Ala Pro Ala Thr Gly Arg Lys Pro 
5 10 15 

GGC GGC GGC CTG CTC TCG TCG CCC GGC GCC GCG CCG GCC TCG GCC GCG 151 
Gly Gly Gly Leu Leu Ser Ser Pro Gly Ala Ala Pro Ala Ser Ala Ala 
20 25 30 

GTG ACC TCG GCT TCC GTG GTG CCG GCC CCG GCC GCG CCG GTG GCG TCT 199 
Val Thr Ser Ala Ser Val Val Pro Ala Pro Ala Ala Pro Val Ala Ser 
35 40 45 50 

TCC TCG GCG GCC GCG GGC GGC GGG CGT CCC GGC CTG GGC AGA GGT CGG 247 
Ser Ser Ala Ala Ala Gly Gly Gly Arg Pro Gly Leu Gly Arg Gly Arg 
55 60 65 

AAC AGT AGC AAA GGA CTG CCT CAG CCT ACG ATT TCT TTT GAT GGA ATC 295 
Asn Ser Ser Lys Gly Leu Pro Gin Pro Thr lie Ser Phe Asp Gly lie 
70 75 80 

TAT GCA AAC GTG AGG ATG GTT CAT ATA CTT ACG TCA GTT GTT GGA TCG 343 
Tyr Ala Asn Val Arg Met Val His lie Leu Thr Ser Val Val Gly Ser 
85 90 95 

AAA TGT GAA GTA CAA GTG AAA AAC GGA GGC ATA TAT GAA GGA GTT TTT 391 
Lys Cys Glu Val Gin Val Lys Asn Gly Gly lie Tyr Glu Gly Val Phe 
100 105 110 

AAA ACA TAC AGT CCT AAG TGT GAC TTG GTA CTT GAT GCT GCA CAT GAG 439 
Lys Thr Tyr Ser Pro Lys Cys Asp Leu Val Leu Asp Ala Ala His Glu 
115 120 125 130 

AAA AGT ACA GAA TCC AGT TCG GGG CCA AAA CGT GAA GAA ATA ATG GAG 487 
Lys Ser Thr Glu Ser Ser Ser Gly Pro Lys Arg Glu Glu lie Met Glu 
135 140 145 

AGT GTT TTG TTC AAA TGC TCA GAC TTC GTT GTG GTA CAG TTT AAA GAT 535 
Ser Val Leu Phe Lys Cys Ser Asp Phe Val Val Val Gin Phe Lys Asp 
150 155 160 

ACA GAC TCC AGT TAT GCA CGG AGA GAT GCT TTT ACT GAC TCT GCT CTC 583 
Thr Asp Ser Ser Tyr Ala Arg Arg Asp . Ala Phe Thr Asp Ser Ala Leu 

165 170 175 

AGC GCA AAG GTG AAT GGT GAG CAC AAG GAG AAG GAC CTG GAG CCC TGG 631 
Ser Ala Lys Val Asn Gly Glu His Lys Glu Lys Asp Leu Glu Pro Trp 
180 185 190 

GAT GCA GGG GAG CTC ACG GCC AGC GAG GAG CTG GAG CTG GAG AAT GAT 679 
Asp Ala Gly Glu Leu Thr Ala Ser Glu Glu Leu Glu Leu Glu Asn Asp 
195 200 205 210 

GTG TCT AAT GGA TGG GAC CCC AAT GAC ATG TTT CGA TAT AAT GAA GAG 727 
Val Ser Asn Gly Trp Asp Pro Asn Asp Met Phe Arg Tyr Asn Glu Glu 
215 220 225 
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AAT TAT GGT GTG GTG TCC ACA TAT GAT AGC AGT TTA TCT TCA TAT ACG 775 
Asn Tyr Gly Val Val Ser Thr Tyr Asp Ser Ser Leu Ser Ser Tyr Thr 
230 235 240 

GTT CCT TTA GAA AGG GAG AAC TCA GAA GAA TTT CTT AAA CGG GAG GCA 823 
Val Pro Leu Glu Arg Asp Asn Ser Glu Glu Phe Leu Lys Arg Glu Ala 
245 250 255 

AGG GCA AAC CAG TTA GCA GAA GAA ATT GAA TCC AGT GCT CAG TAC AAA 871 
Arg Ala Asn Gin Leu Ala Glu Glu He Glu Ser Ser Ala Gin Tyr Lys 
260 265 270 

GCT CGT GTC GCC CTT GAG AAT GAT GAC CGG AGT GAG GAA GAA AAA TAC 919 
Ala Arg Val Ala Leu Glu Asn Asp Asp Arg Ser Glu Glu Glu Lys Tyr 
275 280 285 290 

ACA GCA GTC CAG AGA AAC TGC AGT GAC CGG GAG GGG CAT GGC CCC AAC 967 
Thr Ala Val Gin Arg Asn Cys Ser Asp Arg Glu Gly His Gly Pro Asn 
295 300 305 

ACT AGG GAC AAT AAA TAT ATT CCT CCT GGA CAA AGA AAC AGA GAA GTC 1015 
Thr Arg Asp Asn Lys Tyr He Pro Pro Gly Gin Arg Asn Arg Glu Val 
310 315 320 

CTA TCC TGG GGA AGT GGG AGA CAG AGC TCA CCA CGG ATG GGC CAG CCT 1063 
Leu Ser Trp Gly Ser Gly Arg Gin Ser Ser Pro Arg Met Gly Gin Pro 
325 330 335 

GGG CCA GGC TCC ATG CCG TCA AGA GCT dCT TCT CAC ACT TCA GAT TTC 1111 
Gly Pro Gly Ser Met Pro Ser Arg Ala Ala Ser His Thr Ser Asp Phe 
340 345 350 

AAC CCG AAC GCT GGC TCA GAC CAA AGA GTA GTT AAT GGA GGT GTT CCC 1159 
Asn Pro Asn Ala Gly Ser Asp Gin Arg Val Val Asn Gly Gly Val Pro 
355 360 365 370 

TGG CCA TCG CCT TGC CCA TCT CAT TCC TCT CGC CCA CCT TCT CGC TAC 1207 
Trp Pro Ser Pro Cys Pro Ser His Ser Ser Arg Pro Pro Ser Arg Tyr 
375 380 385 

CAG TCA GGT CCC AAC TCT CTT CCA CCT CGG GCA GCC ACC CAT ACA CGG 1255 
Gin Ser Gly Pro Asn Ser Leu Pro Pro Arg Ala Ala Thr His Thr Arg 

390 395 400 

CCG CCC TCC AGG CCC CCC TCG AGG CCA TCC AGA CCC CCG TCT CAC CCC 1303 
Pro Pro Ser Arg Pro Pro Ser Arg Pro Ser Arg Pro Pro Ser His Pro 
405 410 415 

TCT GCT CAT GGT TCT CCA GCT CCT GTC TCT ACT ATG CCT AAA CGC ATG 1351 
Ser Ala His Gly Ser Pro Ala Pro Val Ser Thr Met Pro Lys Arg Met 
420 425 430 

TCT TCA GAA GGA CCC CCA AGG ATG TCT CCA AAG GCA CAG CGC CAC CCT 13 99 

Ser Ser Glu Gly Pro Pro Arg Met Ser Pro Lys Ala Gin Arg His Pro 
435 440 445 450 
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CGG AAT CAC AGA GTC TCT GCT GGG AGA GGC TCC ATG TCT AGT GGC CTA 1447 
Arg Asn His Arg Val Ser Ala Gly Arg Gly Ser Met Ser Ser Gly Leu 
455 460 465 

GAA TTT GTA TCC CAC AAT CCC CCA AGT GAA GCA GCT GCT CCT CCA GTG 1495 
Glu Phe Val Ser His Asn Pro Pro Ser Glu Ala Ala Ala Pro Pro Val 
470 475 480 

GCA AGG ACC AGT CCT GCA GGG GGA ACG TGG TCC TCA GTG GTC AGT GGG 1543 
Ala Arg Thr Ser Pro Ala Gly Gly Thr Trp Ser Ser Val Val Ser Gly 
485 490 495 

GTT CCA AGG TTA TCT CCC AAA ACT CAC AGA CCC AGG TCT CCC AGG CAG 1591 
Val Pro Arg Leu Ser Pro Lys Thr His Arg Pro Arg Ser Pro Arg Gin 
500 505 510 

AGC AGC ATT GGA AAC TCT CCC AGC GGG CCT GTG CTT GCT TCT CCC CAA 1639 
Ser Ser lie Gly Asn Ser Pro Ser Gly Pro Val Leu Ala Ser Pro Gin 
515 520 525 530 

GCT GGC ATC ATC CCT GCA GAA GCC GTT TCC ATG CCT GTT CCC GCC GCA 1687 
Ala Gly lie lie Pro Ala Glu Ala Val Ser Met Pro Val Pro Ala Ala 
535 540 545 

TCT CCG ACT CCT GCC AGC CCT GCA TCC AAC AGA GCA CTG ACC CCA TCT 173 5 

Ser Pro Thr Pro Ala Ser Pro Ala Ser Asn Arg Ala Leu Thr Pro Ser 
550 555 560 

ATT GAG GCA AAA GAT TCC AGG CTT CAA GAT CAG AGG CAG AAC TCT CCT 1783 
He Glu Ala Lys Asp Ser Arg Leu Gin Asp Gin Arg Gin Asn Ser Pro 
565 570 575 

GCA GGG AGT AAA GAA AAT GTT AAA GCA AGT GAA ACA TCA CCT AGC TTT 1831 
Ala Gly Ser Lys Glu Asn Val Lys Ala Ser Glu Thr Ser Pro Ser Phe 
580 585 590 

TCA AAA GCT GAC AAC AAA GGT ATG TCA CCA GTT GTT TCT GAA CAC AGA 1879 
Ser Lys Ala Asp Asn Lys Gly Met Ser Pro Val Val Ser Glu His Arg 
595 600 605 610 

AAA CAG ATT GAT GAC TTA AAG AAG TTT AAG AAT GAT TTT AGG TTA CAG 1927 
Lys Gin He Asp Asp Leu Lys Lys Phe Lys Asn Asp Phe Arg Leu Gin 

615 620 625 

CCA AGC TCT ACA TCT GAA TCT ATG GAT CAA CTA CTA AGC AAA AAT AGA 1975 
Pro Ser Ser Thr Ser Glu Ser Met Asp Gin Leu Leu Ser Lys Asn Arg 
630 635 640 

GAA GGA GAA AAG TCA CGA GAT TTG ATT AAA GAT AAA ACG GAA GCA AGT 2023 
Glu Gly Glu Lys Ser Arg Asp Leu He Lys Asp Lys Thr Glu Ala Ser 
645 650 655 

GCT AAG GAT AGT TTC ATT GAC AGG A.GC AGC AGC AGC AGC AAC TGT ACC 2071 
Ala Lys Asp Ser Phe He Asp Ser Ser Ser Ser Ser Ser Asn Cys Thr 
660 665 : 670 



67 



AGT GGC AGC AGC AAQ^ ACC AAC AGO CCT AGC ATC TCC CCT TCC ATG CTT 
Ser Gly Ser Ser LysXThr Asn Ser Pro Ser lie Ser Pro Ser Met Leu 



675 



^£80 



685 



690 



AGT AAT GCA GAG CAC ;^G AGG GGG CCT GAG GTC ACA TCC CAA GGG GTG 
Ser Asn Ala Glu His LVs Arg Gly Pro Glu Val Thr Ser Gin Gly Val 
695 \ 700 705 

CAG ACT TCC AGC CCA GcA TGC AAA CAA GAG AAG GAT GAC AGA GAA GAG 
Gin Thr Ser Ser Pro AlaVcys Lys Gin Glu Lys Asp Asp Arg Glu Glu 
710 \ 715 720 



AAG AAA GAC ACA ACA GAG 
Lys Lys Asp Thr Thr Glu 
725 



GTT AGG AAA TCG ACA TTG AAT CCC AAT 
Val Atg Lys Ser Thr Leu Asn Pro Asn 
730 735 



GCA AAG GAG TTC AAC CCT CGT VcT TTC TCT CAG CCA AAG CCT TCT ACT 
Ala Lys Glu Phe Asn Pro Arg ker Phe Ser Gin Pro Lys Pro Ser Thr 
740 745 \ 750 

GCA CAA CCC AGC CCA TCT ATG GTG 
^Ala Gin Pro Ser Pro Ser Met Val 
765 770 

^C ACT CAG CCT GTG TGC TTC GCA 
Thr Gin Pro Val Cys Phe Ala 
780 785 



ACC CCA ACG TCA CCT CGG CCT 
Thr Pro Thr Ser Pro Arg Pro 
755 760 

GGT CAT CAG CAG CCA GCT CCA GTG 
Gly His Gin Gin Pro Ala Pro Val 
775 



CCC AAT ATG ATG TAT CCC GTC CCA GTG V^GC CCG GGC GTA CAA CCT TTA 
Pro Asn Met Met Tyr Pro Val Pro Val 55er Pro Gly Val Gin Pro Leu 
790 795 \ 800 

TAC CCA ATA CCT ATG ACG CCC ATG CCT GtIb AAC CAA GCC AAG ACA TAT 
Tyr Pro lie Pro Met Thr Pro Met Pro Va^ Asn Gin Ala Lys Thr Tyr 
805 810 \ 815 



AGA GCA GGT AAA GTA CCA AAT ATG CCC CAA 
Arg Ala Gly Lys Val Pro Asn Met Pro Gin 
820 825 



PlG CGA CAA GAC CAA CAT 
j.n Arg Gin Asp Gin His 
830 



CAT CAA AGC ACC ATG ATG CAC CCA GCC TCC GCA GCA GGG CCA CCC ATC 
His Gin Ser Thr Met Met His Pro Ala Ser AlaWa Gly Pro Pro He 

835 840 845 \ 850 

GTA GCC ACC CCG CCC GCT TAC TCC ACT CAG TAC GOT GCC TAC AGC CCT 
Val Ala Thr Pro Pro Ala Tyr Ser Thr Gin Tyr Va\ Ala Tyr Ser Pro 
855 860 \ 865 

CAG CAG TTT CCC AAT CAG CCT TTG GTC CAG CAT GTG O^G CAT TAT CAG 
Gin Gin Phe Pro Asn Gin Pro Leu Val Gin His Val Piso His Tyr Gin 
870 875 \ 880 

TCT CAG CAT CCT CAT GTG TAC AGT CCT GTC ATA CAA GGT\ AAT GCC AGG 
Ser Gin His Pro His Val Tyr Ser Pro Val He Gin Gly^sn Ala Arg 
885 890 895 



2119 



2167 



2215 



2263 



2311 



2359 



2407 



2455 



2503 



2551 



2599 



2647 



2695 



2743 



68 

ATG ATG GCA CCA CCA GCA CAT GCT CAG CCT GGT TTA GTG TCT TCT TCA 2791 
Met Met Ala Pro Pro Ala His Ala Gin Pro Gly Leu Val Ser Ser Ser 
900 905 910 

GCT GCT CAG TTC GGG GCT CAC GAG CAG ACG CAC GCC ATG TAT GCA TGT 283 9 

Ala Ala Gin Phe Gly Ala His Glu Gin Thr His Ala Met Tyr Ala Cys 
915 920 925 930 

CCC AAA TTA CCA TAC AAC AAG GAG ACA AGC CCT TCT TTC TAC TTT GCC 2887 
Pro Lys Leu Pro Tyr Asn Lys Glu Thr Ser Pro Ser Phe Tyr Phe Ala 
935 940 945 

ATT TCC ACC GGC TCC CTC GCT CAG CAG TAT GCA CAT CCT AAT GCC GCC 293 5 

lie Ser Thr Gly Ser Leu Ala Gin Gin Tyr Ala His Pro Asn Ala Ala 
950 955 960 

CTG CAT CCA CAT ACT CCC CAT CCT CAG CCT TCG GCC ACT CCC ACC GGA 2983 
Leu His Pro His Thr Pro His Pro Gin Pro Ser Ala Thr Pro Thr Gly 
965 970 975 

CAG CAG CAA AGC CAG CAT GGT GGA AGT CAC CCT GCA CCC AGT CCT GTT 3031 
Gin Gin Gin Ser Gin His Gly Gly Ser His Pro Ala Pro Ser Pro Val 
980 985 990 

CAG CAC CAT CAG CAC CAG GCT GCC CAG GC'T CTT CAT CTG GCC AGT CCA 3079 
Gin His His Gin His Gin Ala Ala Gin Ala Leu His Leu Ala Ser Pro 
995 1000 1005 1010 

CAG CAG CAG TCG GCC ATT TAT CAT GCG GGG CTG GCA CCA ACA CCA CCT 3127 
Gin Gin Gin Ser Ala lie Tyr His Ala Gly Leu Ala Pro Thr Pro Pro 
1015 1020 1025 

TCC ATG ACA CCT GCC TCT AAT ACA CAG TCT CCA CAG AGC AGT TTC CCA 3175 
Ser Met Thr Pro Ala Ser Asn Thr Gin Ser Pro Gin Ser Ser Phe Pro 
1030 1035 1040 

GCA GCA CAA CAG ACA GTC TTC ACC ATC CAC CCT TCT CAT GTT CAG CCG 3223 
Ala Ala Gin Gin Thr Val Phe Thr lie His Pro Ser His Val Gin Pro 
1045 1050 1055 

GCA TAC ACC ACC CCA CCC CAC ATG GCC CAC GTA CCT CAG GCT CAT GTA 32 71 

Ala Tyr Thr Thr Pro Pro His Met Ala His Val Pro Gin Ala His Val 
1060 1065 1070 

CAG TCA GGA ATG GTT CCT TCT CAT CCA ACT GCC CAT GCG CCA ATG ATG 3319 
Gin Ser Gly Met Val Pro Ser His Pro Thr Ala His Ala Pro Met Met 
1075 1080 1085 1090 

CTA ATG ACG ACA CAG CCA CCC GGT CCC AA^ GCC GCC CTC GCT CAA AGT 3367 
Leu Met Thr Thr Gin Pro Pro Gly Pro Lys Ala Ala Leu Ala Gin Ser 
1095 1100 1105 

GCA CTA CAG CCC ATT CCA GTT TCG ACA ACA GCG CAT TTC CCT TAT ATG 3415 
Ala Leu Gin Pro lie Pro Val Ser Thr Thr Ala His Phe Pro Tyr Met 
1110 1115 1120 
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ACG CAC CCT TCA GTaI CAA GCC CAC CAC CAA CAG 
Thr His Pro Ser Vail Gin Ala His His Gin Gin 
1125 I 1130 

TTGGAGGAAC CGAAAGGCdA AATCCCTTCT TCCCTTCTCT 

CAGAAAACTA GAACTTCATT GATTTTGTTT TTTAAAAGAT 

AGGAATGCTA ACAGCTCACTXTGCAGTGGAG GATGTTTTGG 

GACTTGTGGC TGTTCCATAA '^r^CCATGTGC TGTTGGAGGG 

GCTTGCTGAA ACTGGAAGTT ATOTATTTTT TAATGGCCCT 

GCTAGCAACA GAAGTAACAA GAGTGATTCT TGCT 



CAG TTG TAAGGCTGCC 
Gin Leu 
1135 

GCTTCTGCCA ACCGGAAGCA 
ACACTGATTT AACATCTGAT 
ACCGAGTAGA GGCATGTAGG 
TCCTGCAAGT ACCCAGCTCT 
TGAGAGTCAT GAACACATCA 



3464 

3524 
3584 
3644 
3704 
3764 
3798 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 sX amino acids 

(B) TYPE; amino Acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION :\ SEQ ID NO: 5: 



Met Ser Leu Lys Pro Gin Pro Gin 
1 5 



Ptp Pro Ala Pro 
10 



Lys Pro Gly Gly Gly Leu Leu Ser Se^ Pro Gly Ala 
20 2! 

Ala Ala Val Thr Ser Ala Ser Val Val \Prb Ala Pro 
35 40 



Ala Thr 



Ala Pro 
30 

Ala Ala 
45 



Gly Arg 
15 

Ala Ser 



Pro Val 



Ala Ser Ser Ser Ala Ala Ala Gly Gly 
50 55 

Gly Arg Asn Ser Ser Lys Gly Leu Pro 
65 70 



GlV 



Ly Arg Pro 
60 

Pro Thr 
75 



Gly Leu Gly Arg 



He Ser 



Gly He Tyr Ala Asn Val Arg Met Val His\ He Leu Thr Ser 
85 90 

Gly Ser Lys Cys Glu Val Gin Val Lys Asn hly Gly He Tyr 
100 105 \ 110 

Val Phe Lys Thr Tyr Ser Pro Lys Cys Asp Leu Val Leu Asp 
115 120 \ 125 



Phe Asp 
80 

Val Val 
95 

Glu Gly 



Ala Ala 



His Glu Lys Ser Thr Glu Ser Ser Ser Gly Prd Lys Arg Glu Glu He 
130 135 \ 140 



70 

Met Glu Ser Val Leu\phe Lys Cys Ser Asp Phe Val Val Val Gin Phe 
145 Vso 155 160 



Lys Asp Thr Asp Ser sAr Tyr Ala Arg Arg Asp Ala Phe Thr Asp Ser 
165 \ 170 175 

Ala Leu Ser Ala Lys Va3\ Asn Gly Glu His Lys Glu Lys Asp Leu Glu 
180 \ 185 190 

Pro Trp Asp Ala Gly Glu Leu Thr Ala Ser Glu Glu Leu Glu Leu Glu 
195 \ 200 205 

Asn Asp Val Ser Asn Gly Trp Asp Pro Asn Asp Met Phe Arg Tyr Asn 
210 2:^ 220 

Glu Glu Asn Tyr Gly Val Val Ser Thr Tyr Asp Ser Ser Leu Ser Ser 
225 230 \ 235 240 

Tyr Thr Val Pro Leu Glu Arg Vsp Asn Ser Glu Glu Phe Leu Lys Arg 
245 \ 250 255 

Glu Ala Arg Ala Asn Gin Leu AH.a Glu Glu lie Glu Ser Ser Ala Gin 
260 \ 265 270 

Tyr Lys Ala Arg Val Ala Leu GlU Asn Asp Asp Arg Ser Glu Glu Glu 
275 284 285 

Lys Tyr Thr Ala Val Gin Arg AsniCys Ser Asp Arg Glu Gly His Gly 
290 295 \ 300 

Pro Asn Thr Arg Asp Asn Lys Tyr ile Pro Pro Gly Gin Arg Asn Arg 
305 310 \ 315 320 

Glu Val Leu Ser Trp Gly Ser Gly Aflg Gin Ser Ser Pro Arg Met Gly 
325 V 330 335 

Gin Pro Gly Pro Gly Ser Met Pro SeiA Arg Ala Ala Ser His Thr Ser 
340 ^^^\ 

Asp Phe Asn Pro Asn Ala Gly Ser Asp ©In Arg Val Val Asn Gly Gly 
355 360 \ 365 

Val Pro Trp Pro Ser Pro Cys Pro Ser Ho^ Ser Ser Arg Pro Pro Ser 
370 375 \ 380 

Arg Tyr Gin Ser Gly Pro Asn Ser Leu Pro\Pro Arg Ala Ala Thr His 
385 390 $95 400 



Thr Arg Pro Pro Ser Arg Pro Pro Ser Arg Pao Ser Arg Pro Pro Ser 
405 410 \ 415 



His Pro Ser Ala His Gly Ser Pro Ala Pro ValXser Thr Met Pro Lys 
420 425 \ 430 
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Arg Met Ser Ser Glu Gly Pro Pro Arg Met Ser Pro Lys Ala Gin Arg 
435 \ 440 445 

His Pro Arg Asn His Arg VaS. Ser Ala Gly Arg Gly Ser Met Ser Ser 
450 455 460 

Gly Leu Glu Phe Val Ser Hia Asn Pro Pro Ser Glu Ala Ala Ala Pro 
465 470 \ 475 480 

Pro Val Ala Arg Thr Ser ProlAla Gly Gly Thr Trp Ser Ser Val Val 
485 \ 490 495 

Ser Gly Val Pro Arg Leu Ser ffro Lys Thr His Arg Pro Arg Ser Pro 
500 \ 505 510 



Arg Gin Ser Ser lie Gly Asn Seoc Pro Ser Gly Pro Val Leu Ala Ser 
515 529 525 

Pro Gin Ala Gly lie lie Pro Ala\ Glu Ala Val Ser Met Pro Val Pro 
530 535 \ 540 

Ala Ala Ser Pro Thr Pro Ala Ser Pro Ala Ser Asn Arg Ala Leu Thr 
545 550 A 555 560 

Pro Ser lie Glu Ala Lys Asp Ser Aag Leu Gin Asp Gin Arg Gin Asn 
565 \ 570 575 

Ser Pro Ala Gly Ser Lys Glu Asn VaA Lys Ala Ser Glu Thr Ser Pro 
580 585\ 590 

Ser Phe Ser Lys Ala Asp Asn Lys Gly b^et Ser Pro Val Val Ser Glu 
595 600 \ 605 

His Arg Lys Gin lie Asp Asp Leu Lys dys Phe Lys Asn Asp Phe Arg 
610 615 \ 620 

Leu Gin Pro Ser Ser Thr Ser Glu Ser MAt Asp Gin Leu Leu Ser Lys 
625 630 I 635 640 

Asn Arg Glu Gly Glu Lys Ser Arg Asp Let lie Lys Asp Lys Thr Glu 
645 65ci 655 

Ala Ser Ala Lys Asp Ser Phe lie Asp Ser\Ser Ser Ser Ser Ser Asn 
660 665 \ 670 

Cys Thr Ser Gly Ser Ser Lys Thr Asn Ser Pro Ser lie Ser Pro Ser 
675 680 \ 685 

Met Leu Ser Asn Ala Glu His Lys Arg Gly pAo Glu Val Thr Ser Gin 
690 695 \ 700 



Gly Val Gin Thr Ser Ser Pro Ala Cys Lys Gln\Glu Lys Asp Asp Arg 
705 710 715\ 720 



. Glu Glu Lys Lys 



Pro Asn Ala Lys 
740 

Ser Thr Thr Pro 
755 

Met Val Gly His 
770 

Phe Ala Pro Asn 
785 

Pro Leu Tyr Pro 



Thr Tyr Arg Ala 
820 

Gin His His Gin 
835 

Pro He Val Ala 
850 



Asp Thr Thr Glu 
725 

Glu Phe Asn Pro 



Thr Ser Pro Arg 
760 

Gin Gin Pro Ala 
775 

Met Met Tyr Pro 
790 

He Pro Met Thr 
805 

Gly Lys Val Pro 



Ser Thr Met Met 
840 

Thr Pro Pro Ala 
855 
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Gin Val Arg Lys 
730 

Arg Ser Phe Ser 
745 

Pro Gin Ala Gin 



Pro Val Tyr Thr 
780 

Val Pro Val Ser 
795 

Pro Met Pro Val 
810 

Asn Met Pro Gin 
825 

His Pro Ala Ser 



Tyr Ser Thr Gin 
860 



Ser Thr Leu Asn 
735 

Gin Pro Lys Pro 
750 

Pro Ser Pro Ser 
765 

Gin Pro Val Cys 



Pro Gly Val Gin 
800 

Asn Gin Ala Lys 
815 

Gin Arg Gin Asp 
830 

Ala Ala Gly Pro 
845 

Tyr Val Ala Tyr 



Ser Pro Gin Gin 
865 

Tyr Gin Ser Gin 



Ala Arg Met Met 
900 

Ser Ser Ala Ala 
915 

Ala Cys Pro Lys 
930 

Phe Ala He Ser 
945 

Ala Ala Leu His 



Thr Gly Gin Gin 
980 



Phe Pro Asn Gin 
870 

His Pro His Val 
885 

Ala Pro Pro Ala 



Gin Phe Gly Ala 
920 

Leu Pro Tyr Asn 
935 

Thr Gly Ser Leu 
950 

Pro His Thr Pro 
965 

Gin Ser Gin His 



Pro Leu Val Gin 
875 

Tyr Ser Pro Val 
890 

His Ala Gin Pro 
905 

His Glu Gin Thr 



Lys Glu Thr Ser 
940 

Ala Gin Gin Tyr 
955 

His Pro Gin Pro 
970 

Gly Gly Ser His 
985 



His Val Pro His 
880 

lie Gin Gly Asn 
895 

Gly Leu Val Ser 
910 

His Ala Met Tyr 
925 

Pro Ser Phe Tyr 



Ala His Pro Asn 
960 

Ser Ala Thr Pro 
975 

Pro Ala Pro Ser 
990 



Pro Val Gin His His Gin His Gin Ala Ala Gin Ala Leu His Leu Ala 
995 1000 1005 
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Ser Pro Gin Gin Gin Ser Ala lie Tyr His Ala Gly Leu Ala Pro Thr 
1010 \ 1015 1020 

Pro Pro Ser Met Thr Pro Ala Ser Asn Thr Gin Ser Pro Gin Ser Ser 
1025 103P 1035 1040 

Phe Pro Ala Ala Gin Gin \Thr Val Phe Thr lie His Pro Ser His Val 
1045 \ 1050 1055 

Gin Pro Ala Tyr Thr Thr dro Pro His Met Ala His Val Pro Gin Ala 
1060 \ 1065 1070 

His Val Gin Ser Gly Met vAl Pro Ser His Pro Thr Ala His Ala Pro 
1075 \ 1080 .. 1085 

Met Met Leu Met Thr Thr GlA Pro Pro Gly Pro Lys Ala Ala Leu Ala 
1090 1035 1100 



Gin Ser Ala Leu Gin Pro Ile\Pro Val Ser Thr Thr Ala His Phe Pro 
1105 1110 \ 1115 1120 



Tyr Met Thr His Pro Ser Val tin Ala His His Gin Gin Gin Leu 
1125 \ 1130 1135 



